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Preface 



The purpose of these notes is to introduce a group of subjects in out of equihbrium 
statistical mechanics that have received considerable attention in the last fifteen years 
or so. They are mostly connected with time-reversibility and its relation to entropy, 
are expressed in terms of large deviations, and involve at some level the notion of 
timescale-separation. 

We shall consider systems that contain, in their dynamic rules, an element of noise. 
There are good reasons for this: On the practical side, a driven system needs to dis- 
sipate heat if it is to reach a stationary regime rather than heating up indefinitely. 
Stochastic systems, where energy is provided by the bath through time-dependent, 
random forces, are well-studied physical models of heat reservoirs. A second, equally 
important reason is that very often, dynamical systems in the presence of noise are 
much easier to study than purely deterministic ones, because then very subtle er- 
godicity considerations become trivial. If the aim of crgodic theory is to understand 
how randomness arises from deterministic constituents, once stochasticity is added 
'by hand' the question is artificially bypassed. One may then concentrate on the issues 
that are specific to non-equilibrium systems with many degrees of freedom, just as 
one postpones ergodicity questions in the day to day practice of equilibrium statis- 
tical mechanics. One last consideration is that even purely deterministic systems are 
sometimes more clearly understood as a small-noise limit: This stochastic stability 
approach is very natural and appealing, not only from the physical, but also from the 
mathematical point of view (sec e.g. (jCowieson et al 2005P ). 

Out of equilibrium statistical mechanics is a domain shared between theoretical 
physicists, mathematical physicists and probabilists, a fact reflected by a severely 
fragmented literature Workers in each one of these fields have in mind a different 
network of relations between subjects and techniques. Two ideas that look similar to a 
physicist may seem very distant to a probabilist, and vice-versa. The physicist's point 
of view - the one I adopt ~ stresses the relations between dynamics and the statistical 
mechanics in space-time, between stochastic evolution and quantum mechanics, and 
is on the alert for hidden symmetries and for scaling. 

The first lecture introduces stochastic dynamics in a formalism that uses as much 
as possible the analogy with quantum mechanics, on the assumption that the reader 
is already familiar with Shrodinger's equation. Next, we discuss the consequences of 

have f ollowed, as general references, the following: the books by Risken IIRisken 198411 . Gar- 
diner IIGardi ncr 19831 and Van Kampen l |Van Kampen 1981( for the general stochastic context. 
Parisi's book flParisi 1988[l in several places, in particu lar the relation between sochastic and quan- 
tum mechanics. Zinn-Justin's book l lZinn-Justin 19961 for technical background on path integrals 
and a more field-theo retic point of view. I have als o found very illuminating the Lecture notes of 
H. Hilhorst l|Hilhorst| l (in French) and of J. Cardy | |Cardy 1999[ l, whe re the reader may bridge the 
main gap of these lectures: renormalisation. The review of Hanggi et al | |Hanggi et al 1990^ has a very 
comprehensive view of activation processes. 



vi Preface 



time- reversibility (in particular detailed balance) , and how this is intimately related to 
thermodynamic equilibrium. Crucial for these notes is the fact that the term respon- 
sible for the breaking of a time-reversal symmetry in an out of equilibrium system is 
directly related to entropy production. In lecture three we discuss timescale separation 
and metastability. In particular, we present through an example a general formalism 
for metastability (jGaveau et al 19981 IBovier et al 2000p based on the spectral decom- 
position of the dynamical operator. It is quite elementary and intuitive, and is unjus- 
tifiably little known. We also describe very briefly the hydrodynamic limit, mainly to 
present an example where fluctuations become weak through coarse-graining, rather 
than through low temperatures. This allows us to carry over the 'low noise' results, 
that we introduce for simplicity in the case of low temperatures, for smooth, large- 
scale fluctuations: this is the Macroscopic Fluctuation Theory (jBertini et al. 200ip . In 
lecture five we present two forms of large-deviations: i) Low- Noise: we stress as much 
as possible the complete analogy between the 'Freidlin-Wentzell' (jFreidlin et al. 19841 
IKitahara 1975P theory and the standard WKB semiclassical approach of quantum me- 
chanics, ii) Deviations of long-time averages: In the context of glasses this is refrerred 
to as Space-time Thermodynamics (jJack et al. 2006]) ). because the long-time large de- 
viation functions are in complete analogy with (d+ 1)- dimensional thermodynamics. 
We shall show that the phase transitions encountered in systems with slow dynamics 
within this formalism are closely related to the spectral manifestations of metasta- 
bility discussed in lecture three. With all these elements in hand, and playing with 
the time-reversal symmetry and its breaking, we obtain the Fluctuation Theorem and 
Jarzynski's equality, the subject of the last lecture. 

The main aim of this short course is to stimulate curiosity. If it feels incomplete I 
will judge it successful. 
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1 

Trajectories, distributions and path 
integrals. 



In this section we introduce equations of motion containing a deterministic part, and 
a stochastic thermal bath. Next, we make the passage from a description in terms of 
trajectories to one in terms of distributions. The evolution of 'probability clouds' in 
space is formally very close to (imaginary time) quantum mechanics. We do our best 
to exploit this analogy as much as possible because it opens the way for the application 
of all the methods in quantum mechanics and field theory. 



1.1 From trajectories to distributions 

1.1.1 Trajectories 

Let us start by considering a system satisfying Hamilton's equations. In addition, we 
shall allow for the possibility of external forces that do not derive from a potential 
acting on the system, throughout this work we shall denote them f (q). Nonconservative 
forces do work, and tend to heat the system up. If we wish that the system eventually 
become stationary we need a thermal bath to absorb energy, both theoretically and 
in practice. The simplest thermostat is the Langevin bath, consisting of friction term 
proportional to the velocity and white noise. The equations of motion read: 

' (?) = rr^ = ... unless otherwise stated here — — — Vj 

Forcing Thermal Bath 

we shall use throughout Vi = Pi/m. If the thermal bath is itself in equilibrium, noise 
intensity and friction coefficient are related by: 

(ry,(t)?7,(t')) =27T%<5(t-t') (1.2) 

which, as we shall see, allows the system to equilibrate at temperature T in the absence 
of forcing. The parameter 7 measures the intensity of coupling to the bath. In an 
unforced system it does not affect the stationary distribution - the Gibbs-Boltzmann 
distribution ~ e~^^ - but it does control the dynamics: compare for example a system 
of particles interacting with a potential V in equilibrium with a medium at temperature 
T, when the medium is made of air or of honey (small and large 7, respectively). 
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Equation can be justified in a number of ways. In section 2.2.1 we sliall see 
liow this can be done. 

Let us now consider tlie overdampcd case of large 7. We can formally (and some- 
what dangerously) neglect the acceleration term as follows: 

dV 

miji + -fQi + h = ni{t) 

oqi 

+ ^ + = ?^(t) (1.3) 
In the last step we have rescaled time as 7r = t, 

which yields {rji{T)rjj (t')) = 2T6ij6{T — r'). A few things to note in the passage to the 
overdamped equation ()1.3p are: 

• We now have half the dynamic variables. 

• The velocity qi is now discontinuous in time, as is the noise itself. We have to 
be careful what we mean by (|1.3|) . for example if we are going to programme it 
in a computer. We adopt the Ito convention., which means, for example in one 
dimension: 

q{t + 5t) - q{t) + 5t {V + f){q{t)) = {St)^/^r^{t) (1.4) 

The simplicity comes from the fact that the force is evaluated in the old time t, 
and does not anticipate the result at the new time (i + St) ( [Van Kampen 19811 
IGardiner 1983, RiskcnTOSil [Hilhorstp . 

• As we shall see, these ambiguities^ disappear when we consider the evolution of 
distribution functions instead of the trajectories. 

• Because the velocity is discontinuous, so is the quantity J^i fiii'- neglecting inertia 
makes power become a subtle business. Indeed, some derivations, in particular 
involving work, become more transparent keeping inertia, which makes velocities 
a smooth function of time. 



1.1.2 Distributions 

Let us now change point of view, and consider the system rather than from the point 
of view of individual trajectories with particular noise realisations, as a 'probability 
cloud' evolving in space. The passage from the former to the latter description can be 
done in several ways, and it is instructive to see their relations. 

Consider the Langevin equation p.3p . We wish to obtain the equation of motion 
of the probability P(q, t). Let us first do it separately for a process in the absence of 
forces, and for a process of advection in the absence of noise. We obtain respectively: 



Which arc the analogue of factor-ordering ambiguities in quantum mechanics 
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9i 



dP 

dt 



dV 



dP 
dt 



I dqi 



dV 



P 



The first equation is just diffusion. Tlie second uses tlie famiiiar fact that if a distri- 
bution is carried by a flow q,; = gi(q), its evolution is given by the advective derivative 

A useful remark: A trick one impUcitly uses often in computer simulations is that 
whenever two processes act simultaneously and their effect in a small time-interval is 
small, one gets the same result by alternating short intervals with each acting alone. 
If the separate evolutions are ^ = ~HiP and ^ = —H2P, we can with the same 
argument recompose this as ^ = {Hi +H2)P. Applied to the previous situation, this 
means that the evolution of the probability for the full Langevin equation (jl.3p is: 



dP 
dt 



d 
dq. 



^ dq. 



dV 



p 



Pici,t) ^ -HFpP{(l,t) 
where we have defined the generator: 



d 



dqi dqi 



^FP - - 

Writing (jl.5p as a continuity equation, we identify the current 
P = — div J with the definition Ji(q) 



P 



(1.5) 



(1.6) 



(1.7) 



Exactly the same procedure can be used to derive the evolution of the probability 
for the case with inertia (jl.l[) . In order to split the system in a diffusion and an 
advection term, we need to work in phase space. The result is the Kramers equation: 



with 



P(q,p,t) = -i/KP(q,p,t) 
dpidqt dqi dpi ^ dpi \ dpi m 



: bath 



(1.8) 



(1.9) 

Hj : forcing 



(summation convention), where we recognise the Poisson bracket associated with 
Hamilton's equations, plus a bath, and (eventually) a forcing term. 
Again, writing the Kramers equation as a continuity equation: 



dPiq.P,t) 
dt 



= -HkP{ci, P, t) = -divJ = -Y, 



dJ„. 



dJp, 
dpi 



(1.10) 
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we identify a current: 



J. = |[p(q,P,0 J.=-(7T|-+7|[ + f + /.)mP,t). (1.11) 



A surprise is that, even in an unforced, f = equilibrium stationary state the phase- 
space current is nonzero. This suggests that we define an alternative quantity, the 
reduced phase- space current as (jTailleur et al. 2004P : 



The currents (jl.lip and (|1.12p differ by a term without divergence, and hence their 
fluxes over closed surfaces coincide. The interesting property of (|1.12p is that it is zero 
for the canonical distribution, as one can easily check. Furthermore, in a case with 
metastable states it is small everywhere, and it is concentrated along reaction paths. 
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Gaussian thermostat 

In some situations one wishes to study a system with thermostat that pre- 
serves the energy. This can be done with a deterministic (noiseless) 'Gaus- 
sian' (|Hoover 1986P thermostat, extensively used in the context of entropy pro- 
duction and the Gallavotti-Cohen theorem. As we shall see later, it is in some 
cases convenient to have in addition a small amount of energy-preserving noise. 
We thus consider: 

= ^ 

■at 

conservative forciug thermostat 
noise 

(1.13) 

• r]j are white, independent noises of variance e, unrelated to temperature, 
since the energy is fixed. 

• 9ij — ^ij ~ is the projector onto the space tangential to the energy 
surface. 

• Multiplying the first of (|1.13l) by , the second by ^ and adding, one 

concludes that energy is conserved provided j{t) = 

The product gij {p)r]j is rather ill-defined because both and rjj are 
discontinuous functions of time. The ambiguity is raised by discretising 
time (jRisken 1984[) . or by specifying the evolution of probability, as we now 
do. 

Repeating the steps leading to the Fokkcr-Planck and Kramers equation, we 
find that the probability evolves through: 

P{q,p) = -HcPiq.p) (1.14) 

where H is the operator: 

p^ d dV{q) d d d d d 

mdqi dqi dpi dpi dpi dpj dpi 

where the summation convention is assumed. The precise factor ordering in the 
last term is important, and specifies the meaning of Eq. (|1.13p . In the absence 
of driving f = it is easy to check that H annihilates any function that depends 

2 

on the phase-space coordinates only through the energy = -^^^ +V. Hence, 
the noise respects the microcanonical measure, in that case. 



1.1.3 Other spaces. Doi-Peliti variables 

The Hilbert spaces associated with probability distributions of the Fokker-Planck and 
Kramers equations are different, as the former consists of functions of TV-dimensional 
space P(q) and the latter of 2A^-dimensional space -P(q, p). In fact, other spaces appear 
naturally (jPoi 1976| IHilhorstj) when the dynamic variables are not continuous. This 
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does not bring in any new conceptual feature, but it allows to write other stochastic 
problems in a familiar 'quantum' notation. Let us give two examples: 

Bosons 

We consider particles on a lattice, with no exclusion. Denoting number of par- 
ticles in site i, the complete set of configurations is spanned by the space: 

\n) ^ \ni, . . . ,nN) = ®'iLi\ni) , (1.16) 

so that a a probability distribution is written as: 

P= ^ c„i,...,„„|ni, . . . ,nAr) (1-17) 

ni ,. . . ,n7v 

We can write any evolution operator in this space introducing the generators 

ai\ni) ^ - 1) 

a,\0) = (1.18) 

Note that aj and Ui are not mutually Hermitian conjugates. For example, for simple 
diffusion on a onc-dimcnsional lattice reads P = —HP with 

i 

Spins: the Simple Symmetric Exclusion Process 

Boson variables do not lend themselves easily to processes where occupation of sites 
is limited, because particles exclude one another. In those cases, Fermions and Spins, 
which have a finite Hilbert space, appear naturally. To be more definite consider the 
'Simple Symmetric Exclusion Process' on a one-dimensional lattice. This corresponds 
to the stochastic process on the lattice {1, . . . , N} where particles jump to neighbouring 
sites, but with the limitation that each site can accommodate at most one particle. 
Configurations n S {0, 1}^ are then identified with kct states 

|7i) = |7ii,...,nw) =®,jli|n.) , (1.20) 

which specify the occupation number of each site, namely rii € {0, 1}. The bulk evo- 
lution is given by the transition rates 

w{n'+^'\n) = -{n'''+'\HB\n) = (1 - n,)n,+i 

w{n'''+\n) = -{n'''+'\HB\n) =n,il-n,+i) . (1.21) 

where n*'-' is the configuration which is obtained from the configuration n by removing 
a particle in i and adding it in j. 

We introduce the operators S, which act as 

S+\n,) = {l-n,)\n, + l) 
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S~\ni) = ni\n^ - 1) 
1 
2 



5°l^.> = ( - i ) l'^.) • (1-22) 



and satisfy the SU(2) algebra 



[5,-,^+] --25,". (1.23) 



Note again that, in this representation, are not mutually Hermitian conjugates. 

In terms of these, the evolution of the SSEP is generated by a spin one-half ferro- 
magnet (jSchiitz et al. 1994^ 

-Hb^Y. + ^rSii + 2S^S^+, - j , (1.24) 



1.2 Hilbert Space 

We shall throughout these lectures use the bracket notation, following Kadanoff and 
Swift (jKadanoff et al 1986P Doing this, wc uncover the similarities and differences 
between stochastic and quantum dynamics, and allows us to import many techniques 
developed in quantum many-body and field theory. Here we do everything for the 
Fokker-Planck case, the generalisation to other dynamics is straightforward. We define, 
as usual, the q representation: 

P(q) - (qlV-) (1.25) 

The evolution equation becomes: 

^^\i;)=-HFPm (1.26) 

whose solution is 

1^(0) =e-'^--|^o) - P(q,t) = (q|e-*^--|^„) (1.27) 

The transition probability is given by the matrix element: 

P(q, t ; qo, to) = (q|e-<*-*°)^-- |qo) (1-28) 

As in quantum mechanics, it is useful to consider the spectrum of Hpp- Because 
Hpp is not self-adjoint, we have to distinguish right and left eigenvectors: 

i?^^p|^f ) = Aal^f) ; {^^\Hpp = \a{i^^,\ (1.29) 

The resolution of the identity is: 
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Defining tlie (in general, imnormalisable) 'flat' state (— | as a constant over all 
configurations: 

(-|q) = l (1.31) 
and using the fact that conservation of probability implies that, at all times 

1= J2 (qiV'(i)> = (-i^w) -> j^{^\m) = -{-\HFP\m)^o ym 

all configurations 

(1.32) 

and the fact that a stationary state satisfies 

^\stat) = -HFp\stat) = (1.33) 

we have that 'flat' and stationary states are the left and right zero-eigenvalue eigen- 
vectors 

{-\Hfp^O ; HFp\stat)^0 (1.34) 

Finally, writing 

l^o) = ^ l^f ) (1.35) 

a 

we have 

P(q, t) = (q|V'(t)) - 5] (qlV^f )e-*^" (1.36) 

a 

Equation (|1.36p already shows us that Xa cannot be negative, and that the long-time 
properties of the probabilities are encoded in the eigenvectors with small eigenvalues. 
In particular, if all eigenvalues have real parts larger than zero, there is no stationary 
state': this happens typically if the system is unbounded. 

In the Kramers case, all of what we have said applies, provided one considers 
functions: 

^(q,p) = (q,p|^) (1.37) 



Perron-Probenius theorem: P is a probability distribution, so it has to be 
positive everywhere at all times. If one Xa has a negative real part, it dominates 
the sum (|1.36p for large times, its coefficient going to infinity. Because (— IV'a) = 
0, necessarily (qli/'a) takes positive and negative values, and this will make P{t) 
at large times not everywhere positive, contrary to the assumption. 



1.2.1 Correlations and responses 

Correlation functions are averages over many realizations of the process, with different 
realisations of the random noise each time. They can be expressed in this notation as: 

CABit,t') = {A{t)B{t'))reaUsaUon = HA B e-''"\init) (1.38) 

here H may be the Fokker-Planck, Kramers, or in general any Doi-Peliti operator. 
Similarly for the response functions 
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RAB{t,t') = 



5hB{t') 



{-\Ae 



-(t-t')H 



dH 



-t'H\- 



it) 



(1.39) 



where the average (A) is over noise reahsations. Eigenvalues with small real parts have 
something to say about the decay of long-time correlations: 

C{t,t') = {A{t)A{t')) - (-|Ae-(*-*')^A|zmt) = |A|zmt)e-^"* 

a 

- {-\A\stat){-\A\init) + (-|A|^f |A|imt)e-^i* (1.40) 

which decays to the asymptotic value as an exponential of RcAi, the first eigenvector 
with non-zero real part. The existence of gap between lowest and first eigenvalue leads 
to exponential decays of correlations. 

1.2.2 Conserved quantities 

Consider a probability distribution that is concentrated on an energy shell, for example 
in phase space ^^(q, p) — Eg- This can be expressed as an eigenvalue equation: 



(1.41) 



Consider further an evolution that conserves energy, generated by some H. By as- 
sumption, there are no transitions between shells of different energies, so the matrix 
of H is of block form, each one corresponding to a value of Eg- This in turn implies 
that E commutes with the operator H: 



[E, H]=0 



(1.42) 



Clearly, what we have said applies to any conserved quantity. 

The fact that the stochastic process can be now broken into subspaces of fixed 
energy entails the existence for each Eq of a different stationary state \statE^), and 
the corresponding flat measure (— £;„|. 



1.2.3 Analogy with quantum mechanics 

Making the identification: 



we may write 



T 



d_ 
dqi 



(1.43) 



H = THfp = - T— 



(1.44) 



and the evolution is 
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(1.45) 



The situation is analogous to quantum mechanics, with T playing the role of h and 
the the role of 'momentum operators'. The quantum-like Hamiltonian comes from 
a 'classical' Hamiltonian 

. dV ' 



H 



(1.46) 



in a phase space with twice the number of variables. This 'classical' problem with 
double number of degrees of freedom will play a key role in the next sections. 



1.3 Functional approach 

Let us first consider a general and extremely powerful trick to convert an algebraic 
problem of finding the roots of a system of equations Qa{^) = into a statistical 
problem of calculating a partition function. We compute the sum over roots of a 
function A: 

(^W>^E^WIrootsofQ. (1-47) 

This can be written 

(A(x)) = / dx A(x) (5(Qa(x)) det 



(1.48) 

The determinant is there to insure that we count each root with a unit weight. We 
have thus obtained an expression that has the form of a partition function. 

The Martin-Siggia-Rose/DeDominicis-Janssen ([Martin et al. 1973| approach con- 
sists of applying this technique to differential, rather than ordinary equations, in this 
case the Langcvin equation. To make this properly, one should discretise time and 
write a delta function imposing the equation p.4p at each time. One obtains a sum 
over paths: 



dxh 



(x) 



dx 



dx A(x) det 



dxh 



paths with 

appropriate b.c's 



f unctional delta 



D[q] D[y] S 



dV 



e 



path probability 



(1.49) 



Where for the moment we stay in one dimension, and omit the forcing term, the 
generalisation to several dimensions and f 7^ is straightforward. The determinant is 
unity in the Ito convention p.4p , because the matrix of second derivatives (containing 
the derivative of the equation of motion at time U with respect to q{tj)) is upper 
triangular with ones in the diagonal. 
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Introducing the integral representation of the dcha with a 'hat' variable (one per 
time) : 



+?oo 



^response' field 



P{q,t) ^ I D[q] D[f^] I D[q] J /^t] ^^^q^ 

J J —ioo 

The boundaries are free for q. Integrating the Gaussian noise away, we obtain: 



P{q,t)= / D[q]D[q\ exp / dt 



i 

H 



(1.51) 



Making the change of variables q —* ^ the evolution equation becomes, for many 
degrees of freedom: 



P{q,t) = J D[q,q\ cxp ^ J dt 



q,qi 



-Qi q. 



dV 



I 

H 



(1.52) 



(summation convention). The exponent is multiplied by 1/T, which thus plays the role 
of 1/h. From what we know of the path -integral representation of quantum mechanics, 
we recognise the action associated with the evolution equation (|1.45p . The analogy with 
(imaginary time) quantum mechanics can be carried further in the absence of forcing, 
by making the transformation: 



H 



FP e 



-I3V/2 _ 



1 fdV 



Td^V 
4 9^ 



(1.53) 



We now have an Hh which is truly of the Shrodinger form, albeit with a new potential 

^eff — S \ dq,J 4 'd^ 

All in all, the situation is as in Fig. 11.11 we can go from the Langevin equation 
to the Fokker-Planck equation for the distribution directly, and then construct a path 
integral representation of the evolution using the analogy between Fokker-Planck and 
Shrodinger equations. Alternatively, we can go straight from the Langevin equation to 
the path integral, through the Martin-Siggia-Rose/deDominicis-Janssen construction. 
The latter procedure is much more flexible, and is thus extensively used in the physics 
literature. 

1.3.1 Lagrangian (Onsager-Machlup) form 

As in all phase-space problems where the momenta appear quadratically (in this case 
the 'hat' variables g^), we have the possibility of integrating them away, thus going to 
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PATH- 


INTEGRAL 


MSR /dD-J^^^^' 






\ patli-integral 




\ construction 




EOKKER-PLANCK 


LANGEVIN 




direct derivation 





Fig. 1.1 With tlie MSR/dDJ one can go straight from stochastic equation to the path-inte- 
gral. Otherwise, given the equation of motion for the probability distribution one can express 
it as a path integral as in quantum mechanics textbooks. 



the Lagrangian representation (Onsager et al. 1953). Alternatively, we can go straight 
to this representation by integrating p.49p over the noise before introducing the 'hat' 
variables. The result is: 



Piq,t)^ J D[q] exp -^J dt 



dV 



(1.54) 



Note that we have inherited the Ito convention from (|1.4p . which means a definite 
prescription on how to discretise the time integral. This sum takes the form of a 
partition function at temperature ~ T . This means we can in principle compute it using 
well-established Monte Carlo methods: a nonequilibrium problem for the configurations 
becomes an 'equilibrium ' problem in trajectory- space. 

In the absence of forcing f = we can develop the square in the exponent, and 
recognising that the double product is a total derivative, we have: 



1i 



dV 

dqi 



- 2T 



[dqf 



,-2T[^(q)-^(qo)] 



... (^-^^^ 

The second derivative in the exponent comes from the correct discretisation (see 
box). We recognise the action of a polymer - the monomer index being the time 
- at temperature T in a potential cx |VFp. This analogy can be exploited fruit- 
fully (|Chandler et al 1981| . 
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Conventions: 

Suppose wc had started from p.53p and constructed the Lagrangian path inte- 
gral as in quantum mechanics textbooks. We would have obtained (|1.55p . the 
exponentials of the potential at the ends of the trajectory being just the change 
of basis leading to (|1.53p . The extra term — ^ / dt in the exponent appears 

naturally. In our case, it appeared as part of the integral J dt qi (^f|") j once we 
specify what this means in terms of a discrete sum. 

The question of discretisation conventions must be treated with care, as they 
may lead to errors in the results. However, one should not exaggerate the 
physical importance of the whole problem, as it dissappears as soon as the 
system is regularised by including inertia. 



1.3.2 Kramers equation 

All the steps above can be performed for a system with inertia. A somewhat confusing 
fact is that our extended space consists of AN coordinates, including the 27V original 
coordinates and momenta (q, p) playing the role of coordinates of the extended space, 
and the 2N 'hat' variables (q, p), playing the role of momenta in the extended space, 
associated now with the operators: 



2 

Time-reversal and Equilibrium 



As we have seen in the previous section, stochastic dynamics lead to an evolution that 
is quite close to a generic quantum problem, often with a non-Hermitian Hamiltonian. 
Such generality is in a way bad news, since one can hardly expect to find a method 
to solve a problem that is so general. A special and important class of that of systems 
that evolve in contact with an equilibrium thermal bath, and are under the action of 
conservative forces. There is then a symmetry that can be interpreted as time-reversal, 
which has important consequences. We shall also see that if this symmetry is broken 
by forces that do work, the symmetry-breaking term can be interpreted as an entropy 
production rate: this will be the basis of the Nonequilibrium Theorems of section 6. 

2.1 Detailed Balance 

The detailed balance property is a relation between the probabilities of going from a 
configuration a to a configuration b and vice-versa: 

e-'^^('^)p,^, = e-'^^Wn^ (2.1) 

The name 'detailed' comes from the fact that if we only ask for the Gibbs-Boltzmann 
distribution to be stationary, we need only that (|2.ip holds added over configurations, 
and not term by term. We can telescope to obtain for a chain of configurations: 

p p p — p-0lV{a^)~'V{ai)] p p p (9 9\ 

which means that 

Probability [path]= e Probability [reversed path] 

And, in particular, for all closed circuits: 

Probability [circuit] = Probability [reversed circuit] 

In other words, we have the Onsager-Machlup reversibility: 

• The probability of any path going from a to & is equal to the probability the 
time-reversed path, times a constant that only depends on the endpoints a, b. 

• Hence, if for some reason there is essentially one type of path that takes from a 
to 6, then there is also essentially one path that takes from b to a, and it is its 
time-reversed. 

These properties are directly observable experimentally, see (jAndrieux et al. 2007p . 
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2.1.1 Fokker-Planck 

Detailed balance holds for the Fokkcr Planck evolution without forcing fi ~ 0. Let us 
see the implication this has: 

(q'|e-*^^^|q) g-^^^i) - (q|e"*''^^ |q'> e-^^^^') 

(q'le-'^^^e-'^^lq) = (q|e-*^^^e-'^^|q') = (q^-'^'^e"*^^^ |q) (2.3) 

Since this is true V(q, q'), we have: 

^ g-ti/j., ^ e'^^iffpe-'^^ = iJ^p (2.4) 

which in turn means that: 

= eP^'/^Hppe-P'''^ (2.5) 

is Hermitian, a fact that we have already checked (cfr p.5p ). Equation (|2.4p gives also 
a direct relation between right and left eigenvectors: 

l^«>=e^^|^f) (2.6) 

We now understand why an Hermitian form cannot be obtained when there is forcing: 
detailed balance is then lost. 

2.1.2 Kramers 

Let us now see how this generalises to a process with inertia, having an energy 7Y = 

2 

2^ + In this case, something like detailed balance holds, but on the condition 

that we reverse the velocities: 

e-^^'^-^Pa^, = e-'^^Wp,^, (2.7) 

where a, b are the configurations a, b with the velocities reversed. Also in the Kramers 
case we can telescope (|2.7p to obtain relations for trajectories, and for closed circuits, 
as we did above for the Fokkcr-Planck evolution. 
In operator notation, Eq. (|2.7p reads: 

(q',p'|e-*^-|q,p) e-^^C^-P) = (q, -p|e-*«- |q', -p') e'^^^^'^ (2.8) 
which leads to: 

He''^ Hk e'^'^U-^ = h]^ (2.9) 

where we have introduced the operator that reverses velocities HpH^^ = — p, and 
similarly with derivatives. Hk cannot, in general, be taken to an Hermitian form. 
Applying to (|2.9p steps analogous to those leading from (|2.4p to (|2.5p one finds that 
Hcrmiticity is broken because H^/^ is not real. 
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Another form of time reversal: the adjoint 

A driven overdamped system admits a form of time-reversal (jBertini et al. 20011 
IChetrite et al. 2008P that is not. however, a symmetry. Consider the Fokker-Planck 
operator with nonconservative forces, and assume we know its stationary distri- 
bution: 

d f d \ 
H\stat) ^ -— T— + fi \stat) = (2.10) 



Put {q\stat) = (f>{q) and compute: 



d_ 

dqi 



2T 



d\nc 



H. 



adj 



Where we have defined the adjoint 



d f dim 
T— - { 2T- 



(2.11) 



(2.12) 



This describes another diffusion problem at temperature T in a new force field: 



2T- 



dhi( 



dqt 



(2.13) 



which only coincides with the original one when f derives from a potential. This 
formula is the basis of the Hatano-Sasa formula (|Hatano et al 200T|) . 
A similar but stronger form of time-reversibility arises if we accept that some 
variables change signs, as velocities do. We refer the reader to Ref. (|Graham 1980p . 
Note that in all these cases, we need to know the stationary distribution a priori, 
so the formulas are moderately useful in practice. 



2.2 Equilibrium theorems: Reciprocity and Fluctuation-Dissipation 

Detailed balance is a form of time reversal symmetry in the trajectories. It cannot come 
as a surprise that in equilibrium it implies time-reversal symmetry in the correlation 
functions. Let us do it for the Kramers equation. Denoting \GB) the Gibbs-Boltzmann 
distribution: 

CABit - t') = {-\A e-(*-*''-f^^- B\GB) = {GB\B .4|-) 
= {-\e-P^B e-(*-*')<A ef'^\GB) 

= {-\ Ile-P^BeP^IV-\ e-(*-*')^- ne^ef^^|Gi?) 



CsAit-t') 



(2.14) 



where we have used (|2.9p and we have defined A(q, p) = ^(q, — p)- The same derivation 
can be done for the Fokker-Planck case for observables dependent on coordinates only. 

Another important equilibrium property is the Fluctuation-Dissipation theorem. It 
relates the response of the expectation of an observable A produced by a kick given by 
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field conjugate to an observable B to the corresponding two-time correlation. Putting 
T^hs —'H — HbB, we compute: 



S{A{t)) _ , , , ^-it-t')H dH_ 



RAB{t - t') = ' = (-1^ e 



5hB{t') ' ' dhi 



|GB) (2.15) 

hB=o 



Because the equilibrium distribution in the presence of a field \GB {Iib)) satisfies for 
all Kb that Hh^lGB (hB)) = we have: 



dH 
dliB 



\GB) 



= (ydllB^^ ^ 



= PH {B-{B)) \GB)h,=o 

3=0 

l3HB\GB)hB=o (2.16) 



and substituting in (|2.15p : 



RAB(t - t') = f3{-\A e-(*-*')^ H B \GB) 

^fi^CABit-t') (2.17) 

2.2.1 FDT of the first and second kind. A derivation of the Langevin 
equation. 

At the outset we started with a bath that contained friction proportional to 7 and 
noise whose variance is jT. This very precise relation between noise and friction is 
often refereed to as 'Fluctuation Dissipation of the first kind', because it relates the 
dissipation and fluctuations of the bath, rather than of the system. If the bath satisfies 
this relation, a system in contact with it will eventually equilibrate (this might take 
long) and will then verify the fluctuation dissipation theorem ()2.17|) - of the 'second 
kind' - for its observables. 

Next, assume there is a large number a ~ 1, Af of independent copies of such 
systems, with coordinates q",p", all in equilibrium with a bath. We intend to use 
them in turn as baths for a further system of coordinates q', p' and energy 7i'(q', p'). 
To do this we couple them, for example through a term: 

n = +7i'(q',p') -M-^ ^q".q' (2.18) 



system 



bath coupling 

We may ask what is the condition for the coupling term to constitute a legitimate 
thermal bath for the primed system. The equations of motion of the primed variables 
are: 

q^-^-m (2.19) 

Where the field h is h = M~i J^ala- The large M limit allows us to treat each 
M~iq".q' as a small perturbation to the system Ha, and to invoke the central limit 
theorem to say that M~ 2 q" is a Gaussian. Assuming the expectation values of {qa) — 
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in the absence of coupling, the field h has two contributions for large Af : i) a, random 
Gaussian noise r]{t) with correlation Caa{t,t') = {T]{t)T]{t')) ~ {qa{t)qait')) and ii) a 
drift due to the back effect of the q' which acts as a field on the qa- Again, because M 
is large, the average response of the ensemble a is: 

M-i{q'')^ f dt' R^^{t,t')q'{t') (2.20) 
Jo 

The equation of motion of the primed variable becomes: 

q^~^+rm+ dt' R^o.{t,t')q'{t') (2.21) 
oq Jo 

This is in fact the generalised Langevin equation, describing the primed system in 
contact with a thermal bath with coloured noise and friction with memory 0. The 
condition that the bath is a good equilibrium one is precisely: 

Ti?,„((t,t') = ^a„(i,i') (2.22) 

The primed system will, under the action of this dynamics, equilibrate to the Gibbs- 
Boltzmann distribution q 

• Fluctuation dissipation of the second kind for the qa has become a fiuctuation 
dissipation of the first kind when they are considered as a bath for the primed 
variable q' . 

• Friction is the back reaction of the bath to the perturbation exerted by the sys- 
tem, while noise is given by the incoherent addition of motions of the bath's 
constituents. 

• The fluctuation-dissipation relation regulates the balance between these two ef- 
fects in such a way as to guarantee that equilibrium is 'passed on' to a coupled 
system. 



2.3 Time- reversal violations and Entropy production 

In the previous section we have seen that there is a symmetry related to time-reversal 
in systems that are in contact with an equilibrium thermal bath, and have only con- 
servative forces. Once forcing is allowed, this symmetry is broken: in an interesting 
manner. As we shall now see, the symmetry-breaking term turns out to be propor- 
tional to the power injected by the nonconscrvativc forces, divided by a temperature: 
it can be interpreted as an entropy production 

^The white noise limit 11.11 1 is obtained when Caa{t ~ t') oc d(t — t'). 

^With an energy that includes a contribution {[q"]^)q ^ coming from the interaction term (The 
factor ([q"]^) is the equilibrium expectation for a single isolated Ha). This term can compensated by 
an opposite one in 7i' 

^Subtle questions about when can one call this entropy arc extensively addressed in the litera- 
ture ijMaes 2003|l . here we shall not get into these matters 
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For simplicity, we shall do this calculation in two cases with inertia, in order to 
avoid unnecessary complications brought about by the fact mentioned above that the 
power in an overdamped case is a discontinuous function of time. 

Kramers equation 

Let us attempt to obtain a relation like (|2.9p in the presence of forcing. Referring 
to (frgj) Hk = HLtouviiie + Hb + Hf we compute Ue^^ Hxe'^^U-^ in detail. First, 
we make the similarity transformation: 



Liouville 

d Pi\ d 



„I3H TT _ TT 

^ ^Liouville^ 

\ opi m J dp, 
ef^^Hfe-f'^^-fJ^-fA (2.23) 



Next, Hermitian conjugation changes signs of derivatives, and reverses the order of 

d 

dpi 



factors. Finally, velocity reversal transforms {pi —Pi) and ~W~- ^^"^ 



get: 



(here again, Vi = Pi/m is the velocity). As announced, we find that relation 
has now an extra term corresponding to power divided by temperature, an entropy 
production rate. 

Gaussian Thermostat 

We may repeat the calculation for the Gaussian thermostat (|1.15p . Only velocity 
reversal is necessary, and we obtain: 

[uHgH-']' = Hg + te]^ - ^ [IP.] 

= Hg-{N-1)^-^ (2.25) 

(summation convention) where 2N is the dimension of phase-space. Again, this has 
the interpretation of a power divided by a kinetic temperature (~ p^). 



2.4 What does a stationary out of equilibrium distribution look 
like? 

A driven system, if coupled to a thermostat will reach a stationary distribution. The 
problem with out of equilibrium statistical mechanics is that there is no simple, general 
expression for this distribution. If the thermostat keeps the energy constant, as the 
Gaussian one we defined above, even if the probability is then restricted to an energy 
shell, it does not cover it in a uniform, microcanonical way. In fact, if the thermostat is 
deterministic, the distribution will be a fractal. If, on the other hand, the thermostat 
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involves noise, the fractal will be blurred, but the distribution is still non-uniform 
on the energy shell. Can we get some intuition on this? The purpose of this section 
is to see, in a relatively simple example (jBonetto et al. 1997al IKurchan 2007^ . what 
happens when a system is forced. 

We shall consider a Lorentz gas, or, equivalently, a particle in a billiard as in Figure 
12.11 under the effect of a constant field, with periodic boundary conditions. We assume 
that there is a Gaussian thermostat, which fixes the velocity modulus to be constant, 
that we can take as unity. The trajectories between bounces are given by: 




Fig. 2.1 Left: a trajectory at zero field. Right: trajectory under a very strong field, pointing 
in tlie downward direction: the particfe foflows, through short bounces, the surface of the 
obstacle, and escapes when the field becomes tangent to it. 
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Px = E 



Py = 



Epi 

EpxPy 
p2 



= -Esine = - — [-£; cos 6*] 
du 



(2.26) 



where we have defined the angle {px,Py) = p(sm 6*, cos 6*) (Figure [2?2|) If there were no 




Fig. 2.2 Free trajectory under a field. Attractor A and repellor A are parallel and antiparallel 
to the field. 

obstacles, there would be two stationary situations: when the velocity is parallel to 
the field (stable) and when it is antiparallel (unstable). Such trajectories constitute 
the attractor and the repellor respectively. 

Consider now the effect of obstacles. When the field is off, in the presence of 
obstacles the system is known (jSinai 1963P to be ergodic. Phase-space points on the 
energy shell - the Cartesian product of the allowed configurations times the velocity 
sphere, are visited uniformly. On the other extreme, if the field is very strong, the 
trajectories stay bouncing close to the surface of the obstacles until they escape along 
a tangent direction parallel to the field, only to hit a new obstacle - see the right panel 
of Figure [2111 If we consider a stationary situation, these 'trickling down' trajectories 
involve only a very restricted part of configuration space. Adding energy-conserving 
noise to (|2.26p does not change dramatically the situation. In conclusion, the stronger 
the forcing field E, the more focussed the attractor reached at long times is on a subset 
of the energy shell. As we shall see later, the Gallavotti-Cohen fluctuation relation is, 
in a certain sense, a measure of this focussing. 
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2.5 Spectra of Fokker-Planck, Kramers and Liouville operators 

As we have seen above, the detailed balance property implies that the Fokker-Planck 
operator can be taken to an Hermitian form via the similarity transformation ()2.5p . 
This implies that its eigenvalues are real. If, on the other hand, detailed balance is 
violated, then there is no such transformation, and some eigenvalues come in complex 
conjugate pairs. In any case, the Perron-Frobcnius theorem mentioned above implies 
that the real parts of eigenvalues are non-negative. 

On the other extreme, the Liouville operator corresponding to pure Hamiltonian 
dynamics 



_dHd__dn^ 

dpi dqi dqi dpi 



(summation convention) would seem to be, at least superficially, anii-Hermitian, since 
it has only first derivatives. Its spectrum would be pure imaginary, except that we have 
to be careful when we define the space of wavcfuncions on which it acts. Clearly, the 
Kramers operator, which in a sense interpolates between both, is neither Hermitian 
nor anti-Hermitian, and has pairs of complex eigenvalues even in the conservative case, 
as we have mentioned already. 

Consider a strongly chaotic, 'mixing' Hamiltonian system. If we start from an initial 
configuration distributed in a small probability cloud, the probability distribution 
mixes completely in phase-space - like a drop of ink in a stirred liquid, hence the name. 
Another way of obtaining the same result is to start from a single configuration, but 
subject the dynamics to a small noise. The fast mixing of the probability implies a 
concomitant fast decay of correlation functions to their stationary value. As discussed 
above (see Eq. p.40[) ). this would imply that there is a gap in the real part of the 
spectrum. Hence, if the deterministic system is sufficiently chaotic, we expect that, in 
the presence of a small amount of noise, its spectrum will have a gap in the real part 
of the lowest eigenvalues. When the noise is strictly zero the situation is more subtle, 
as there are many unstable periodic orbits, on which the correlations are of course 
periodic in time. 

A very interesting question is then what happens when we start from a Hamiltonian 
system with stochastic noise, and gradually decrease the noise's intensity. This can be 
done with the Kramers equation without forcing, or, better, with the energy-conserving 
thermostat (|1.15p . letting e ^ 0. The remarkable result, consistent with the discussion 
above, is that, if the system is sufficiently chaotic, as we let e ^ some eigenvalues 
do not become imaginary: they retain a positive real part. These eigenvalues, and the 
corresponding eigenvectors, are the Ruelle-PoUicott resonances (jRuelle 1986P : the ones 
that have smaller real part are responsible for the long time relaxation to equilibrium. 
A simple example of this phenomenon is discussed in the box below. 



Spectra of Fokker-Planck, Kramers and Liouville operators 23 



An instructive example is the partiele in a harmonic potential with noise. The 
diagonalisation of the corresponding Kramers operator is simple, and can be 
found in Risken's book (jRisken 1984p . Chapter 10: The spectrum is as follows. 
The eigenvalues are labelled by ni,n2 = 0, 1, 2, 

• Stable case V = ^uj^q'^: 

Eigenvalues Xni,n2 = ^li^i + '^2) + | y/ 4:Uj'^ — 7^ (rii — n^) The spectrum 
becomes imaginary in the 7 — > limit of zero coupling to the bath, when the 
system becomes an undamped harmonic oscillator. This is compatible with the 
existence of many stable, periodic orbits. 

• Unstable case V = —^oj^q^: 

Eigenvalues Xni,7i2 = (ni+n2 + l) + ^7(ni— 712 — 1) All eigenvalues 

are real and larger than zero. The latter is to be expected, given that there is 
no stationary state. The fact that even in the limit 7^0 the spectrum stays 
real may come as a surprise: it underlines the fact that although the Liouville 
operator seems superficially anti-Hermitian (and would thus lead us to expect 
pure imaginary eigenvalues), in fact the Hilbert space in which it acts makes it 
not be so. Again, we find that the spectrum retains a real part as 7 ^ when 
it is has unstable orbits {q{t) = 0, p{t) = 0, in this case) that are destroyed by 
the any amount of noise (see also Refs. ( [Gaspard et al. 1995 Gaspard 2003[ )). 
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Many systems have processes happening in very different timescales. Often, what is 
interesting is what takes long to happen, while the rapid fluctuations are relatively 
featureless. Consider the following examples: 

• Metastability: Chemical reactions have often metastable states. Consider a mix- 
ture of Oxygen and Hydrogen. It takes a very short time for this gas to become 
'equilibrated' into a mixture O2 + 2i72, staying in a stationary state until the 
reaction O2 + 2iJ2 — > 21120 starts somewhere - an extremely unlikely event - and 
then rapidly propagates throughout. The true equilibrium state, water vapour, is 
then reached. 

Similarly, diamond eventually decays into graphite, but this process is fortunately 
slow. 

• Hydrodynamic limit: Systems with soft modes have slow evolution along these 
modes, while the 'hard' ones relax much faster. The typical example is a liquid, 
whose macroscopic motion is visible and slow, while the density fluctuations at 
the molecular scale evolve fast. As we shall see below, in some cases one can make 
a 'hydrodynamic' description, with the fast fluctuations acting as a noise whose 
intensity goes to zero with the coarse-graining scale. 

• Coarsening and glasses: Suppose one quenches a ferromagnet to a low, but non- 
zero temperature. Domains of positive and negative magnetisation start growing. 
Inside each domain, the system resembles a pure ferromagnetic state with fast 
magnetisation fluctuations. The domain walls, however, evolve slowly, the slower 
the larger the domains. A more subtle case is the one of glasses, which have a fast 
evolution ('cage motion') and slow, collective rearrangements ('aging'). 

In all these cases, our ambition is to concentrate as much as possible in what is slow 
and interesting. In some cases, when we know a priori which are the slow coordinates, 
we may attempt to eliminate the fast fluctuations by allowing them to thermalise, at 
fixed value of the slow coordinates which we then treat adiabatically. One thus obtains 
a 'free-energy landscape' for the slow variables (see e.g. ()E et al. 2002^ ). 

What happens when we do not know exactly who is fast and who is slow? In the 
next sections we introduce a general approach to metastability, first in detail in a 
simple warming-up context, and then mention briefly how it works in general. In the 
last section, we describe the hydrodynamic limit of a transport problem, and how it 
takes us to a low-noise (quasi-deterministic) situation. 
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3.1 Metastability 

3.1.1 The simple case of weak noise 

Consider an ovcrdampcd Langcvin system. We wish to analyse the spectrum of its 
Fokkcr-Planck evolution operator in the weak-noise limit. Because we know that T 
plays a role analogous to h, we shall use what we know from semiclassical Quantum 
Mechanics. Let us first transform Hpp to its Hermitian basis (12.51) : 
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(3.1) 



Consider first a onc-dimensional harmonic potential: 
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We have 
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(3.3) 



Apart from the global factor ^ we recognise the Hamiltonian of a Quantum oscillator 
with oj = \a\/2 , h ~ T and huj = Ta/2. The eigenvalues are then huj{n + i). that is: 



l\ T\a\ 



"+2 



Ta 



0, \a\, 2\a\, 
\a\, 2\a\, . 



if a> 
if a <0 



(3.4) 



As expected, the lowest eigenvalue is zero in the stable, and positive in the unstable 
case. The gap between eigenvalues is proportional to the curvature of the potential. 

We can extend this result to the stationary point of any potential, using the fact 
that at low temperature only the neighborhood of the saddle points contribute. De- 
veloping Veff around a minimum, which we assume is in g = 0, and putting x 
we have 



2 3 

^e// = K'//(o)^ + K;/(o)y+- 
=K';/(o)y +k';/(o)Vt^ +•■■ 

" V ' 

suhdorniiiant 



(3.5) 



So that 

Hh = 2 



Id^ 
2d^ 



\v"m 



j x'^ — —V"{0) + subdominant 



(3.6) 



The eigenvalues are given by p.4p with V"{0) playing the role of a. 
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The generalisation to many dimensions is straightforward. Since one has to develop 
the potential only to second order around a saddle, one can then go to the basis where 
the second derivative matrix is diagonal, and treat each mode as an independent 
oscillator. Every saddle point yields to this order an independent spectrum in the 
Fokker-Planck operator, but only local minima have zero eigenvalues. There is then 
exactly one zero-eigenvalue per minimum. If the calculation is done exactly, one finds 
that the degeneracy is lifted by a small amount inversely proportional to the passage 
times between states, in this case exponentially small in the inverse temperature. If 
there are p local minima, there are then p 'almost zero' eigenvalues Ai,...,Ap with 
the associated escape times A]~^, bounded by the smallest escape time tpass ~ 
min{A];\ A^^. 

Let us now turn to the eigenvectors corresponding to the 'almost zero' eigenvalues. 
The construction we have done above for eigenvalues can be completed to obtain 
(approximate) right and left eigenvectors. Close to a minimum, we expect the right 
eigenvector to be correspond to a Gibbs distribution peaked around it. Similarly, we 
expect the left eigenvector to be essentially a constant. This is in fact the case, but the 
result is stronger, as it holds throughout the basin of attraction of each minimum. The 
situation is depicted in Fig. 13.11 For small temperatures, it is as if an infinitely high, 
thin wall would enclose each basin, within which the eigenvectors and eigenvalues are 
those of the isolated region. Eventually, the finite-temperature corrections which split 
the degeneracy of 'almost zero' eigenvalues, also mix the approximate eigenvectors 
localised in each basin. 



3.1.2 General approach to metastability. Spectra, states and committor 
functions 

It turns out that the low-temperature situation is just an instance of a very general 
scenario. Indeed , one can turn things around and use the existence of a gap in the 
spectrum to give a general and useful definition of metastability (|Gaveau et al 1998{ 



IBovier et al 2000)) (see (Biroli et al (2001) I, for an application to metastability in glasses). 



Consider an evolution operator having the lowest p eigenvalues Ai, Ap whose real 
part is separated by a gap from the others Ap+i, Ap+2, (see Fig l3.2p . There are two 
characteristic times tpass = min {A^^, A~^} and t* = max {X~^i, X~^2y ■■■}'■ they 
will be interpreted as the minimal time needed to escape a metastable state, and to 
equilibrate within a state, respectively. In the previous section t* is of order one and 
tpass is exponentially large in T. At times of order t 3> i*, the dynamics projects 
completely to the space below the gap, but ii t <^ tpass there is still no time for dis- 
cerning between different eigenvectors below the gap. Clearly, the operator exp[— 
for t* t <^ tpass is essentially a projector onto the space 'below the gap' (up to 
terms of order exp[— iAa], with a > p). 

Within the same accuracy, it turns out that one can then find a basis of p right 
eigenvectors \Pa) which are: 

• positive: Paiq) - (q|Pa) > 0: 

• almost stationary: H\Pa) ^ V a = 1, ...,p 

• normalised and not zero in non-overlapping regions of space. 
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Fig. 3.1 Low-temperature spectrum for a multi-valleyed potential V^(top), and the asso- 
ciated effective potential 14// (below). Only the minima of V contribute with near-zero 
eigenvalues. The near-zero (in this case three-dimensional) subspace is spanned by right 
eigenfunctions that are essentially the equilibrium distribution in each basin, and left eigen- 
functions (the committors) that are essentially constant within each basin. The places where 
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As a consequence the \Pa) vectors have all the good properties of metastable states: 
they are positive normalised distributions, non zero only on different regions of the 
configuration space and are stationary on time scales less than tpass- The last property 
is related to the fact that one can also find a basis of p almost-stationary {{Qa\H ~ 0) 
left eigenvectors {Qa\- They satisfy the approximate orthogonality and normalisation 
conditions: 

(Qaln) - 5ab (3.7) 

One can also write approximately: 

e-*^^^|P„)(Q,| (3.8) 

a 

Note that neither {Qa\ nor |Pf,) are exact eigenvectors 'below the gap', but linear 
combinations of them. 

The (5a(q) are the committor functions (jBolhuis et al.\ IDu et al)) of the states, 
giving the probability that starting on a certain point q the dynamics (again, in times 
<C < <C tpass) goes to the state a. This can be easily seen as follows: the probability 
of ending in a point q' starting from a point q is, at times of order t* <C t <C tpass 

Probability^ (q'|e-*«|q) ^ ^(q'|P,)(Qa|q) (3.9) 

■i 

If the point q' is well within the state 'a\ then -Pa(q') is large and the other Pi, {b ^ a) 
are small. There is only one non-zero term in this sum, and we conclude that the 
probability to fall in the state 'a' is proportional to Qail)- Each Qa{<l) = {Qa\<l) is 
essentially one within the basin of attraction of the state a , and almost zero everywhere 
else. The places where the Qa(q) ~ ^ are called the transition states. 

Given any observable A, we can calculate its average within the state 'a' as: 

{A)a = {Qa\A\Pa) (3.10) 

Again, the situation described above can be summarised by saying that for times 
t* t <^ tpass everything happens as if there is an infinite wall enclosing each basin 
of attraction. In the proof, as in the simple example of the previous subsection, the 
definition is unavoidably linked to the timescales {t*, tpass)' if one considers really 
infinite times, before any other limit, then the distinction between states vanishes. In 
real life, this separation of timescales might be controlled by a parameter, or it might 
be just a more or less valid approximation. As an example of the former, consider 
an Ising ferromagnet of size L: the longest relaxation within a state is the time for a 
domain to grow to the size of the system t* ^ L^. The longest overall time is the one 
needed to flip global magnetisation, which requires jumping the highest energy barrier 
tpass ~ e'^^ . For large L, there is an ample regime i* <IC t ^ tpass where there are 
two well-defined states. Many examples for which in fact this construction is the most 
interesting have no parameter that controls the separation of tpass and t*: the time 
for relaxation within a metastable state, and the time for escaping it, are different but 
not infinitely so. 
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Fig. 3.2 The general situation in a system with metastability, whatever the origin of 
timescale separation. The states \Pa} that can be interpreted as metastable probability dis- 
tributions are linear combinations of the right eigenvectors 'below the gap'. 

3.2 Transition currents 

Given a state metastable state |P) constructed as above, we can find the probability 
current 'leaking' from it directly. For example, in a Fokker-Planck case putting P(q) — 

f d dV \ 
J,(q) (X (t^^ + + ^^"^^ ^^'^^^ 
Because every metastable state P is a linear combinations of the exact eigenvectors 
ipail) = ("00 I q) with eigenvalue below the gap: 

P(q) = J2 (3.12) 

a=l 

the associated escape currents are (see Fig |3.3p linear combinations of currents obtained 
by acting on each of them 
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[Tdi + V,] 




Fig. 3.3 Spectrum and escape currents. Acting witii [T-^ + ^] on the eigenvectors 'below 
the gap', one obtains a basis for all the interstate currents. 

If the stationary state has no current, then there are only p — 1 independent currents, 
rather than p of them. 

In many physical situations, we have some idea of the reaction path followed by 
the current, but we do not know the intensity of such a current or the reaction rate. 
Let us derive a formula for the rate in terms of an unnormalised current distribution 
3{q). The only assumption we make is that the reaction time is slower then any other 
relaxation (|Tailleur et al. 2004[ ) . Suppose one has the current J escaping a metastable 
state in an overdamped Langevin problem P(q) with only conservative forces : 



J,(q) cx (^T— + — j P(q) with P(q) = c„V^(q) (3.14) 

and J d^q V^(q)V'f (q) = S^s ^ J rf'^q e^^ ^f (q)^f (q) (3.15) 

The last equality in equation p.lSp is deduced from the relation between right and 
left eigenvectors (|2.6p . We first compute: 



/^v-.^^/.",{(.|-.S;).}e«{(.4-.-).] 

= T j d\P (HfpP) = J2 ^o.cl (3.16) 



and similarly: 
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J d\ e^^ (div 3f J d\ {HppP) e/"" (H^pP) = ^ Xld, (3.17) 

a—l 

where we have used the eigenvalue equation and the normahsation (|3.15p . Let us now 
assume for simplicity there is only one metastable state, so that there are p = 2 
eigenvalues 'below the gap'. At large times t* t tpass ~ '^ni': only the first 
non-zero eigenvalue A,„ contributes to the sums, and we get: 

tacUv - A,„ - ^ ^2^2 - T ^ d^q 6^^ (div J)^ ' ^"^''^^ 

Note that the normalisation of the current is irrelevant. This formula is valid on the 
assumption of separation of timescales, irrespective of its cause. 

For the Kramers equation, a similar expression can be obtained in the same way, 
in terms of the reduced current 11.121 A tedious but straightforward calculation 
yields (|Tailleur et al. 2004|) : 

, ,-1 7/ V^P J-'^(q, P) J-'^lq, -p)] 

tacu. ne Tjd^cid^^ ePE divq,p[j-'^(q, p)]divq,p[j-'^(q, -p)] ' ^""-'^^ 

note the sum in the numerator runs on coordinates and not on momenta. This formulas 
arc useful because they do not depend on a global normalisation of the current. 

3.2.1 Arrhenius formula 

The Arrhenius expression for the activated passage at low temperatures case can be 
easily derived from the formula (|3.18p , using the fact that the numerator is dominated 
the neighbourhood of the barrier top, and the numerator by the neighbourhood of 
the bottom of the starting well. Let us show how this works for a one dimensional 
double well as in Fig. 13.41 The current starts in the metastable state, is approximately 
constant, and falls in the stable state. 

The divergence of the current J{q) of a state ^^{q) satisfies 

divJ = Hpp^jiq) = Xrn^iq) (3.20) 

where A™ is the first non-zero eigenvalue. Now, we have seen that if V'(<z) is a metastable 
equilibrium distribution in the departure state, it is proportional there to the Gibbs- 
Boltzmann distribution in that basin, so that: 



divJ (X V(g) oc e-f^y (3.21) 
up to a normalisation. The current in a point q within the metastable basin is then: 

J{q) = r dg'divJ(g') ^ r dq' e"'^^ ~ 6"'^^"- /' dq' e-^^-^^'"/^ (3.22) 



where we have used a Gaussian approximation. Around the barrier, we get: 
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escape current J{q) 



Fig. 3.4 Escape from a metastable state of height Vmin through a barrier of height VMax- 
Below: the current distribution. 

The denominator of (|3.18|) is dominated by the neighbourhood of the minimum: 

1 /2 

/ dq e^^ (div Jf ^ [dq e-'^^ ~ g-'^^-'" (^^) (^.24) 

J basin in / 

The numerator in (jS.lSp becomes, using the fact that J is essentially constant around 
the maximum 



1/2 

dq J ~ J barrier j ^q ^ ( „ J J barrier (3.25) 

J barrier KP^ Max / 



f r / \ 1/2 

J barrier J barrier Max > 

Putting numerator and denominator together, we get: 



T j dq (div J)2 /ly" V". I 

\ I Max rnm I 



which is Arrhenius formula (Hanggi et al 19901, with the good prefactor. Note how 



simple the argument is, once we have p.lSp . 

3.3 Hydrodynamic Limit and Macroscopic Fluctuations 

The Simple Symmetric Exclusion Process of Equations ()1.24p and Figure 13.51 has a 
separation of timescales, this time brought about by a local conservation law (of par- 
ticle number) rather than by barriers. Consider an L-site long, isolated chain, with 
average occupation one half. Suppose now that we make a vacancy of twenty con- 
tiguous unoccupied sites. For this to happen spontaneously is an extremely rare event 
( ~ 2~^°), and left on its own, the vacancy will be covered rapidly, in a time of order 
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one. On the other extreme, consider a slowly varying average density profile, say as a 
sinusoidal oscillation of length L. Such a fluctuation will take a time of order to 
die out. 

This separation of timescales manifests itself in the stochastic evolution operator, 
which has eigenvalues of order 1 for the most steep and for the smoothest spatial 
fluctuations, respectively. 

We may now choose to study only the smooth fluctuations, corresponding to the 



lowest eigenvalues, see (S pohn 1983| IBertini et al. 200 To do this we introduce a 
parametrisation of space |^ Xfc = ^ and rescale the time as t ^ L^t. In the rescaled 
time, steep fluctuations disappear immediately. At the macroscopic level, the density 
profiles we consider are smooth functions and discrete gradients can be replaced by 
continuous ones: 



Pk+i - Pk 



L ' 



pk+i - Pk 



^Pj^k) 
L ' 



L-l 

fe=i 



dx 



(3.27) 



It can be shown that in terms of these variables, the evolution of the smooth density 
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Fig. 3.5 The simple symmetric exclusion process, and its coarse-grained hydrodynamic 
limit. 

fluctuations is given by the stochastic equation ( [Spohn 1983[ ) 



p=-VV + V[v/p(l-p)7?]; P(0) = po; p(l) = Pi (3.28) 

where 77 is a white noise of variance 1/(2 L). This is the formula for the fluctuating 
hydrodynamics of the exclusion process ( [Spohn 1983[ ). 

^ Note that X here is an index labelHng the field p{x). Comparing with the examples in the first 
sections, we should make the correspondence {qi,i) — > (p(x),a;) 
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We can now construct the action with the Martin-Siggia-Rose ([Martin et al. 1973| 
formahsm. Performing the usual steps as in section 1.3, introducing conjugate fields 
p{x, t), wc get: 



This time, L ^ plays the role of temperature - or of h, and we have the 'classical' 
Hamiltonian density: 



The paths are constrained to be Pi{x) and Pf{x) at initial and final times, respectively. 
The values of p are unconstrained, which is in agreement with the fact that this is a 
Hamiltonian problem with two sets of boundary conditions. 

The message of this section is that we may transfer all the 'semiclassical' low-noise 
(small T) techniques to the coarse-grained limit to obtain a 'Macroscopic Fluctuation 
Theory' IjBertini al. 200 1| [Jordan et al. 2"004| . 




(3.29) 




(3.30) 



4 



Large Deviations 



In equilibrium statistical mechanics we are given the probability of being in any par- 
ticular configuration. For a dynamical system, we may wish to ask similar questions 
concerning histories, rather than configurations: what is the probability that the sys- 
tem visits a sequence of configurations at given times, or that during a time-interval 
its average enery has a given value, and so on. Often these events are rare, their proba- 
bility is small. In spite of this they may be important: for example, what is interesting 
in chemistry are reactions that are slow compared to thermal vibrations. In this sec- 
tion we shall study two types of large deviations: those that are rare because they 
are induced by (weak) noise, and those that are rare because they are sustained for 
a long time. Technically, this provides us with two small parameters: noise intensity 
and inverse time-span, respectively. 

4.1 Climbing to unusual heights 

We now study the probability of finding the system in unusual configurations. The 
formalism is essentially the WKB theory for semiclassical quantum mechanics, math- 
ematicians know it as the Freidlin-Wentzell (jFreidlin et al. 1984P formalism. For sim- 
plicity, the discussion in this section will be for the overdamped case, but one can do 
the same for any other stochastic equation. 

In the small-noise limit, whichever its origin (low temperature, hydrodynamic 
limit), the equations of motion are essentially deterministic. If we ask for the prob- 
ability P(qo,to — *■ ^.^t) of the system meeting an appointment at time t in q, given 
that it started in qo at time to we may get two sorts of answers: i) Essentially one if 
a deterministic path precisely passes by the given points at the given times, ii) Ex- 
ponentially small ^ Q-T^'^'i-*^) otherwise: only thanks to the noise the system can get 
out of its deterministic schedule in order meet the appointment (this includes being 
in the right place at the wrong time). In order to calculate probabilities, we go back 
to the path-integral expression (|1.52p 



As we mentioned in section 1, this is an imaginary-time path-integral with the 'clas- 
sical' Hamiltonian 



The path integral is dominated by the extremal trajectories, which satisfy Hamilton's 
equations: 




(4.1) 




(4.2) 
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a - ^ - - (o + ^ + f 

~ dq, ~ ^] \ dq, dq, J 'iJ ^ dqi 

The probability now takes the large deviation form: 

ln{P(qo,<o ^ q,i)} = -^J't = -^Action (4.4) 
which defines the large deviation function J^f(q). The action is the integral 

Action = - dt[Y^ q,q, - H{(i, q)] (4.5) 



with (q, q) solution of 
Noiseless solution. 

A family of solutions of (|4.3p can be easily found: 

dV 

= ; = (4.6) 

The dynamics of the original (hat-less) variables is just the noiseless equation of motion 
(|1.3p . The action is zero, as can be easily checked: this means that the large deviation 
function will be also zero - in fact, its smallest value. 

Other solutions. 

Other solutions of (|4.3p can be found with qi ^ 0. They do not correspond to motion 
in the original force field p.3p . signalling the fact that noise is playing an important 
role. The action is positive, a fact that can be best appreciated in the Lagrangian 
formalism (|1.54p . The large deviation function is now positive, and the probability is 
exponentially suppressed in l/T, again an indication that noise is playing a role. 

Large times 

In most cases, we are interested in the long time limit J^t at large times: 

e-iFi^) = P{q„,to ^ q, t) (4.7) 

t — 'OO 

or, otherwise stated, in the probability of finding a stationary system in a configuration 
q. The corresponding large-time deviation function (which we shall denote simply as 
J-') becomes independent of the initial condition. How can this be? 

Consider first a quadratic potential V = ^q^ in one dimension, as in Fig. 14.11 The 
Hamiltonian is if = —q{q + q). The 'classical' trajectories are given by: 

{ = "'^r (4.8) 

q^ q 

The solution with q(t) = corresponds to the relaxation to the minimum. If we now 
start at time to at, say, q — 1 and ask that at time < > to we be at g = — 1, we have 
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= -<i{g + g) 





Fig. 4.1 The {q,q) space for a Langevin process in a one-dimensional quadratic potential. 
The incoming straight lines indicate the noiseless 'downhill' trajectories, the outgoing straight 
lines the 'uphill' trajectories. Curved trajectories missing the origin are relevant for finite-time 
large deviations. 

to take one of the trajectories with q > ; the one that arrives in q = — 1 at the right 
time. If we now consider very large times t, the solution will be one that passes close 
to the (hyperbolic) point (g = , g = 0), in the vicinity of which it will spend a long 
time. In the limit t ^ oo, the trajectory is the succession of a gradient descent into the 
origin, and an 'uphill' trajectory emerging from the origin. Similarly, the trajectory 
that starts in q = 1 and ends at large times at g = +2 is composed of a gradient 
descent towards the left, followed by an uphill motion to the right - the limit of a 
g < 'bounce'. 

In conclusion, in order to calculate the stationary large-deviation function we have 
to consider the 'downhill' trajectory (the anti-instanton) with g = from the initial to 
the stationary point, followed by an 'uphill' (instanton) trajectory from the stationary 
point to the final point. The large deviation function is the sum of the downhill and 
the uphill actions. As mentioned above, the former is zero: When calculating the 
probability of reaching a configuration at large times, we may consider that we started 
from a stationary point - a physically intuitive result since in a short time at the 
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Fig. 4.2 Trajectories in (g, g) space for a double-well potential. 



beginning the system goes from initial to stationary configuration. 

We need tlie 'uphill', instanton trajectories to calculate, via their action, the prob- 
ability of a rare configuration. For generic dynamics this is hard problem to solve 
analytically, and even numerically. 

4.1.1 Detailed balance and Onsager-Machlup symmetry 

As we saw in section 2, if there is detailed balance, and given that one trajectory 
dominates for the downhill process, we should expect the time-reversed trajectory to 
dominate for the uphill process. Let us check this explicitly for a multidimensional 
system with a potential V and no forcing fi = 0. We propose, as partial solution of 

dV 

* - +^ (4.9) 
Inserting in the first of (|4.3p . this implies qi ~ ^ which, when replaced in the second 



of (gJl) gives: 
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Replacing qi ~ in this equation, and using 



d f dV\ _ ^ d^-v . 

dt [ dqj ~ ^dq,dq^' ^ ^ 



1 



we obtain an identity. We conclude that the time-reversed dependence (|4.9p is indeed 
a solution. We may compute the action of the uphill trajectory, which turns out to 
depend exclusively on the initial and final potentials: 

action = J dt ^^q^ qi + J dt q., (^q, + 

= ^ = ^(q) - ^(qmin) (4.12) 

in accordance with the general situation with detailed balance discussed in Section 2.1 
Up to a multiplicative constant, this implies via (|4.4|) that the stationary probability 
is the Gibbs-Boltzmann weight, as expected. 

On the other hand, repeating the calculation in the case in which there is a generic 
force term fi, we get that the second of (|4.3p is satisfied by the time-reversed trajectory 
if: J2j ~ fj = which is in general not true. Hence, we reach the important 
conclusion that in the presence of forces that do not derive from a potential, relaxations 
into and excursions out of the stationary state are not the time-reversed of one another. 
Furthermore, we have to calculate the action on the basis of the explicit solution, and 
this it is no longer miraculously given exclusively in terms of the initial and final 
configurations. 



4.1.2 The Arrhenius Law again 

In the preceding paragraphs we have assumed that in order to reach the final config- 
uration, only one downhill and one uphill trajectory suffice. This is clearly the case 
when there is only one stable state. In the presence of metastability, the trajectory 
may be a sequence of downhill and uphill segments. 

For example, in the case of a double-well potential, the passage probability (the 
Arrhenius law) can be obtained as the probability of falling onto a stable point, then 
climbing up to an unstable point, and then descending to the next saddle, as in figure 
14.21 The probability of descents being of order unity, we are only left with the uphill 
path. Again, if the system derives from a potential, the probability of climbing depends 
only on the difference in height between valley and saddle, and we recover the Arrhe- 
nius law calculated in previous sections (|3.26p . In the presence of non-conservative 
forces, then the path joining stable and unstable saddles has to be computed, and 
from its action we get the probability. 



^ Note that time reversal applies to the q, and not the q variables 
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In many dimensional energy landscapes, one may wonder if the most proba- 
ble path still goes through saddles with only one unstable direction. A mo- 
ment's thought shows this to be the case, as discussed by Murrel and Laidler 
(jMurrel et al 1968^ 



4.1.3 Low noise in phase-space: Kramers and thermostatted. 



Let us mention briefly how one proceeds in the case of phase-space dynamics with 
noise. One obtains by following the same steps as in the previous paragraph, 'classical' 
equations in an extended space {qi,Pi,(ji,Pi)- There are solutions of these equations 
that have qi = Pi = 0: they correspond to the original noiseless equations in the 
original space and have zero action. The rest of the solutions have non-zero {qi,Pi) 
and positive action. 

In some cases, one may go to a 'Lagrangian' description involving only {qi,Pi), 
but then the equations obtained contain second time-derivatives, a relic of the noise 
in the original phase-space. 



^Note that (q,p) are the 'coordinates', and (q, p) the 'momenta' in the 4A'^-dimensional phase- 
space 
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Periodic orbits, complexities and traces 
Classical Mechanics 

What follows is very sketchy, its only purpose is to stimulate the curiosity of 
the reader, who will find an excellent reference on the subject (|Cvitanovic[) . As 
we have seen in the previous sections, the spectrum of an evolution operator 
contains all the information on the crgodic properties of the system. One way 
to study the spectrum of any operator H is to compute the trace Tre~*^, and 
then obtain the resolvent, which has poles in the eigenvalues A of _ff , through 

V = Tr[A -H]-^ = dt Tt e-*(^-^) (4.13) 

. A - Aa Jq 

The trace on the right hand side is a sum over paths just as seen in the previous 
sections, the only difference is that we are to consider closed trajectories in 
which initial and final configurations coincide. For an evolution taking place 
in phase-space (q, p), it is an integral of probabilities of return after time t 
Tre-*^ - / dq dp P(q, p, i ; q, p, t = 0) . 

As mentioned in section 2, one can study the chaoticity properties of Hamil- 
tonian dynamics by studying the spectrum of its evolution operator in the 
presence of noise, and then letting the noise level go to zero: one thus uncovers 
Ruelle-PoUicott resonances. This poses the problem that noise will make en- 
ergy nonconserved, and generate a slow diffusion in energy. To avoid this, we 
may use the energy-conserving noise of the Gaussian thermostat in section 1 , 
leading to Hq ■ 



^ ^ m Pin, dqi dpi ^ 



d d_ 

dpj ^■'^ dpi 



(4.14) 



cfr. Eq. (ITTSl) with f = 0. 

A trace in the path integral becomes then a sum of periodic orbits on the 
energy shell. In the small-noise limit those that dominate are the ones having 
zero action, and these are just the periodic orbits of the original Hamilton's 
equations. One thus expresses Tre"'^'-' as a sum over orbits of period t. 
The final product is that Equation (|4.13p becomes a relation between the resol- 
vent, containing the information on eigenvalues, and a sum of periodic orbits of 
all periods. The interested reader will find this properly done in (jCvitanovicP . 

Complexity in Statistical Mechanics 



A different application of the same idea (Biroli et al (2001)) is to count the 
number of metastable states. With the assumptions of section 3.1, picking a 
time t intcrmdiate between the time needed to equilibrate within a state t* 
, and the time needed to escape it tpass, the number of states is given by 
■Instates ~ Tr e^*^ with i* <C i ^ tpass- This becomes a sum over all periodic 
trajectories of period t. 
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Sampling transition paths in practice 

A problem of great interest in physics and chemistry is the one of computing 
in practice the escape route (and rate) from a metastable state, for example 
the decay rate of a metastable molecule. In many cases we have a separation 
of timescales between molecular vibration and the time needed for the actual 
decay to occur. In order to compute the probability P{a,t;b,to) of starting 
at to in a configuration a belonging to the metastable state and reaching a 
configuration b belonging to the stable state at time t, we may sum over paths 
going from a to b, with their appropriate weight given by the action. 
Depending on the dynamics, different approaches are more practical. In the case 
of a system that is strongly coupled to a thermal bath, and follows a Langevin 
equation, one can follow the Lagrangian 'polymer' analogy (jl.54[) . and use any 
Monte Carlo simulation method for a system in equilibrium at temperature T 
to sample the paths. 

For a system that is closer to being deterministic, a well-developed technique 
(TPS: 'Transition Path Sampling') (jBolhuis et aL[) uses an algorithm that sam- 
ples trajectories by modifying them slightly at the barrier. This kind of change 
allows to obtain a new path that still has good chances of being a transition 
(going from state a to state b), at least if the system is not too chaotic. 
There is a large body of literature on the subject (see e.g. (jBolhuis et aL\ 
IE et al. 200"2j IMicheletti et al. 2004]) ) since the potential applications in chem- 
istry, biochemistry and physics are huge. 



4.2 Unusual time averages 

In this section we study a different type of large deviation. Instead of asking for the 
probability of the system reaching an unusual place, we ask for the probability that it 
sustains an unusual time-average for an observable during a long interval: 



The most celebrated example is the average power, which can be interpreted in some 
cases as entropy production: 




(4.15) 




(4.16) 



where v = q Another example is the average potential energy 




(4.17) 



Expressing the probability in terms of trajectories: 




(4.18) 
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and writing the delta lunction as an exponential 

rt \ /'+200 



we get: 



sftA- [ dt' A{t') \ = / diJL eK*^-£ ^(*')) (4.19) 
P{A) = /+^ d^i e^*^ j Z?[q] (Prob. Trajectory) e~''/o 



= f dii e^*^ X e"*G(M) 

J —ZOO 



(4.20) 



(4.23) 



which defines the large-deviation function G'(/i). For example, in the Fokker-Planck 
case, it reads: 

g-tG(M) ^ j D[ci,c^ e'T'/'it' [E.9-4>-ifFP-M£ dt'ACt')] (4 21) 

What we have done is nothing but the analogue of a passage from a microcanonical 
calculation of 'entropy' = In^, to a canonical calculation of 'free energy' G(yu)/yU at 
'inverse temperature' ^. The 'space' in our problem is in fact the time, and 'extensive 
quantities' are those that are proportional to time: we extracted a time in the definition 
of G(/i) in order to make it 'intensive'. For large t, in analogy with the thermodynamic 
limit, assuming that G(/i) has a good limit, we may evaluate the integral over /i by 
saddle point, to obtain: 

lnP{A)^t[^l*A~G{^l*)] (4.22) 

with 

d^ 

This is the Legendre transform taking from canonical to microcanonical. Note that 
A{^*) plays the role of EQ3) in ordinary thermodynamic systems. Now. by simple 
comparison, equation (|4.2ip can be brought back to operator language: 

g-tG(A.) ^ (^^„^^|g-t[Hi.P+M(q)]|j„ji) (4.24) 

A similar expression holds for Kramers equation . In the language of the analogy 
with one-dimensional models, this is just the transfer matrix formalism ( [Huang 1987 [ ), 
along the time-dimension. If we now introduce the right and left eigenvectors of H+fj,A: 

[H + M(q)] IV-f (m)) = Km^iiA) ; (l^f (a^)I W + M(q)] = A,(M)(IV'f (m)I 

(4.25) 

we obtain: 

^-tGi,) ^ J2 (fina#«(M))(^f (/i)Nmt) e-*^"('') (4.26) 

a 

and, as t ^ oo: 

Gifl) ^ Xrmnip) (4.27) 

where XminifJ') is the eigenvalue with the smallest real part. Again, this is in complete 
analogy with the expression of free-energy densities in terms of the transfer-matrix 
lowest eigenvalue. 
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Note the fundamental difference between this large-deviation functions and those 
of section in that case by 'large' deviations we meant that they are exponentially 
small in the temperature (or the coarse-graining size) while here we mean that they 
arc sustained for long times, and the only large parameter is precisely the time t. 

4.3 Simulating large deviations (and Quantum Mechanics) 

Large deviations can be measured by evolving the system with its real dynamics, 
and then making a histogram of the deviations obtained. However, equation (|4.24[) 
suggests that we try to simulate directly a system that evolves through Hpp + /lA 
(jGiardina et al 20061 lAldous 1994 } : 

P = -[Hpp + nA]P P(q, t) = e-*[»^^'+^^l (4.28) 

Clearly, as t — > oo the distribution P tends to the eigenvalue with the lowest real part. 

We are dealing with a dynamics without probability conservation. In fact, we can 
reproduce it by using a large number of non-interacting walkers, each performing the 
original (Langevin) dynamics with independent noises, occasionally giving birth to 
another walker starting in the same place, or dying. A negative (positive) value of 
^J.A{^) gives a probability |/iA(q)|c?i of making a clone or of dying, respectively, in a 
time-interval dt. At each time, the global number of clones M(t) changes, in such a way 
that for long times M{t)/M{t = 0) ~ g-AminlM)*^ practice, one can normalise the 
total number periodically by cloning or decimating all walkers with a random factor. 
The factor needed to keep the population constant is, again, the exponential of the 
lowest eigenvalue. 

Notice that imaginary-time Shrodinger equation is precisely of the form ()4.28p . 
with no drift in the Langevin process and iiA the quantum potential. Indeed, the 
method described above was developed for precisely this case, and is called 'Diffusion 
Monte Carlo' (DMC) (|Grimm et aZp . 



5 

Metastability and dynamical phase 
transitions 



In several places above we have pointed out that the stochastic dynamics can be seen as 
a kind of 'thermodynamics in space-time'. Trajectories contribute with a weight given 
by the (Onsager-Machlup) action, much as energy determines the Gibbs-Boltzmann 
weight in thermodynamics. Large deviation theory just consists of biasing the measure 
with an extra weight added to the dynamic action and computing the new sum, which 
then looses the meaning of a transition probability and becomes a large-deviation 
function. 

Systems with non-trivial dynamical properties sometimes show very little in their 
static (time-independent) structure. The typical example is that of glasses, which 
are virtually indistinguishable from liquids from the point of view of organisation of 
the molecules, until one looks at their dynamics, which is dramatically slower. This 
situation has motivated some researchers (jJack et al. 2006| to look into space-time 
thermodynamics - the statistical properties of trajectories - for the missing structure. 
One considers the large deviation theory of systems that are dynamically non-trivial, 
and indeed it turns out that one often finds (jGarrahan 2007^ that there is a rich 
structure of 'dynamic' phase transitions in the large-deviation functions. 

In this section we shall see that space-time transitions are closely related to the 
approach to metastability of section 3, to which they provide useful insights. 



5.1 A simple example 

Let us start by a simple example of a particle in a potential V performing overdamped 
Langevin motion at low temperature T (see Fig. 15. We shall consider the large 
deviation function of energy G'(/i) , associated with the probability of observing an 
average energy V (|4.17|1 over long time-intervals. As we saw in the previous section, 
G(/i) is obtained from the lowest eigenvalue of Hpp — l^V . 



H 



FP 



fJ.V = - 



-T— + - -V" + ^TV 

dx^ 8 4 2 



(5.1) 



(see (jBenzi et al. 1985)) for a similar application). The lowest eigenvalue at low tem- 
peratures is given by the same developments as in section 3.1.1, only that now we have 
to add the extra term proportional to fi in (j5.1|) . At small T each minimum in the 
effective potential 
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Fig. 5.1 Left: the potential V and the related effective potential 14//. Right: energy (hori- 
zontal lines) and the associated large-deviation function G{fi). The system has two first order 
transitions at /i = and at /i = —(14 — Vr)/|Vs 

Veffii^)^^-jr' + ^TV (5.2) 

contributes separately, just as in section 3.1.1. To leading order in T the contribution 

of saddles point of V{q^) at q'' is A fiV{q'') if it is a minimum, 

and A ~ [|^^"(9^)| + ^^(9*)] if it is a maxinmm. The lowest amongst all eigenvalues 

dominates: 

r Ai = ^lVL 

G{fi) = Xmin = min < \r = ^iVr (5.3) 

[\3 = \V's\+tlVs 

The values of V are given by the Legendre transform V{^*) ~ and read: 

(Vl fJ.*> 
Vifi*) ^IVr; ^l* < and ^i* > -\V^'\/iVs - K) (5.4) 

[Vs M*< -\V:'\/iVs~Vr) 

there are two first order phase-transitions, see Fig. 15.11 

Let us pause and analyse this physically. The scenario is typical first order, with 
three homogeneous 'phases' in time, corresponding to the three values of Equation 
(|5.4p . When we condition a long trajectory to having a time-averaged energy 14, 
this is realised by the system by making a 'phase coexistence' of periods tL,ts,tii 
spent in each of the three stationary points, such that t = II + ts + tu and Vot = 
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Vl^l + Vsts + VrIr. This is strictly analogous to the ice/water coexistence when the 
total energy is fixed. An important lesson is that if the system is conditioned to having 
a value of an observable intermediate between that which it takes in two metastable 
phases, it prefers to achieve this by spending some time in each state, rather than all 
the time in an intermediate situation. 

5.2 Spectral properties and phase transitions 

One can in fact show that the situation we have seen above is very general. In par- 
ticular, it is quite normal that we should have a first order transition at ^ = 0. To 
see this, let us use the formalism of section 3. Consider the operator H. If there are 
p independent long-lived states, with eigenvalues Aa < ipass ~ 0: ^^'^ ^ time t* to 
thermalise inside a state, we can construct a basis of p right and left eigenvectors Pa, 
Qa, with {Qa\Pb) ~ ^ab having essentially zero eigenvalue. 

Let us now calculate the eigenvalues of iJ -I- fiA, for small /i, but still /i ^ tpass- 
We may use (non-Hermitian) first order perturbation theory, to get: 

Aa - Xail^ = 0) + l-l{Qa\A\Pa) - M(^)m state a (5.5) 

In other words, the quasidegenerate eigenvalues split proportionally to the expectation 
value of the observable A in each state, and the phase that dominates is 

G^= min {{A)a} (5.6) 
states a 

Remarkably, the distribution function has in fact pinned down a 'pure state' Pa, Qa-, 
using the observable A. When the sign of fi is reversed, the 'minimum' in equation 
()5.6p is transformed into a 'maximum' and the selected state is changes. There is hence 
always a first order phase transition. Playing with a different observable, we may make 
a different transition that selects any state. Hence, we conclude that the dynamic phase 
transition approach is in fact equivalent to the metastability one of that of section 3, 
but it gives us new tools and a practical perspectives. 



6 

Fluctuation Theorems and Jarzynski 
equality 



Nonequilibrium work relations, the Fluctuation Theorem IjEvans et al. 19931 lEvans et al 19941 
IGallavotti et al. 1995p . and Jarzynski's ( [Jarzynski 1997[ ) equality, are very general re- 
sults valid for strongly out of equilibrium systems. They concern the large deviations 
of work. As such they are closely related to - and enrich our perspective of - the 
second Law of thermodynamics. The two subjects are quite similar, and in fact may 
in some cases be encompassed into a single, more general result: Crooks's equality 
(jCrooks 1998p . which we shall not review here. These results are very recent - sur- 
prisingly so, given their technical simplicity, and have received in the last fifteen years 
enormous attention. They are both based on the relation (section 2.3) between time- 
reversal symmetry breaking on one side, and work and entropy on the other. 



6.1 The fluctuation theorem(s) 

We shall consider here only Langevin processes with inertia, and the Kramers equation. 
This makes the discussion simpler, because of the fact mentioned in previous sections 
that power, being a product of force times a velocity, is only a continuous function of 
time when there is inertia. 

We have seen in section 2.3 that the 'time-reversal' symmetry becomes, in the 
presence of forcing: 

[ne^«i?Ke-''^n"i]^ = h], + ^ (6.1) 

The violation of the symmetry is proportional to a quantity that has the form of an 
entropy production (see ()4.16p and (|2.24p ): 

power ^ 

' T tTJo ^ ' 

Equation (|6.ip is the basis for the results we shall discuss. In fact, there are several 
variants of time reversal, and each gives different identities (jChetrite et al. 2008]) . 
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General: the implications of an explicitly broken symmetry in statis- 
tical mechanics. 

The fluctuation theorem makes use of the exphcit breaking of a discrete symme- 
try, the detailed balance relation. In fact, whenever we have a system composed 
of a part that is symmetric under a transformation, plus an anti-symmetric 
perturbation, we can derive a relation for the large deviation of the perturba- 
tions. Consider for example the statistical mechanics of a system with variables 
si,...,S]\[ and energy Eg, having the discrete symmetry Eo(s) = Eo{—s). This 
symmetry implies the vanishing of all odd correlation spin functions. Now, let 
us perturb the energy with a field E{s) = Eo(s) — ■|A/(s), conjugate to a term 
A/(s) = with M{—s) = — M(s), so that now 

E{-s) = E{s) + hM{s) (6.3) 

Can we conclude something in the presence of /i 7^ 0, when the symmetry is 
explicitly broken? Indeed, we can: consider the distribution of the symmetry- 
breaking term 

P [M(s) = -M] = J dsS [M(s) + M] e-^(^°-t^^) (6.4) 
Changing variables s — > — s, and using the symmetry of Eq'- 
P [M(s) = -M] = J dsS [-M(s) + M] e-'='(-^°+T?7) = e-^^'^Mp j^^^^^ ^ 

(6.5) 

or 

P[M{s)^M] _ 



e'"''" f6 6) 

P [M(s) = -A/] ^ ' 

One wonders if this elementary property has ever been used in other fields 
of physics before the fluctuation theorem. On the other hand, a derivation of 
the Fluctuation theorem that makes close contact with this thermodynamic 
property has been given by Narayan and Dhar (| Narayan et al. 2004[ ). 



The fluctuation theorem is a statement about the distribution P{<Jt) of the average 
quantity (j6.2p . when the experimental protocol is repeated many times. It reads: 



e 



tat 



(6.7) 



A relation like this was first proposed by Evans, Cohen and Morriss (lEvans et al. 1993)) . 

The second Law of Thermodynamics states that the work done on a system over 
long times must be positive. Equation (j6.7p is then a statement about the 'violations' 
of the Second Law, when the average work has the opposite sign 0. The factor t in 



^The quotation marks are just to remember that since the Second Law appHes to the Umit of long 
times tN — > 00 for a single-instance experiment, these are no true violations. 
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the exponent to a certain extent quantifies the supression of the probabihty of such 
processes when the time is large, thus giving a better perspective of the Second Law. 

Equation (j6.7p can be reexpressed multiplying by e"*'^'^' and integrating over at 
as a property of the large-deviation function G{fi) = 7 / ddt P(crt)e~^'^'* (|4.20|4.2ip : 



The resuk ((6J))-(|6?8)) is extremely general (jEvans et al. IQQSllGaUavotti et al. 19951 
IKurchan 19981 ILebowitz et al. 1999p . independent on the model's parameters, and 
valid for several types of dynamics. Two different settings have to be distinguished: 

• Transient: Each measurement of at is made starting from a thermaliscd system at 
temperature T, a configuration chosen with the Gibbs-Boltzmann distribution. At 
time t = 0, non-conservative forces are switched on, at is proportional to the work 
they make during a time t, which need not be long. The system may be isolated 
or connected to a thermostat, which then has to be at the same temperature T. 

• Stationary: Here, the sampling is of a system that by assumption has achieved 
stationarity in the presence of forcing. For this to be possible it needs a thermostat 
to absorb heat. The system is not in equilibrium, and the fluctuation relation is 
valid only in the limit of long sampling periods t 00. 

Another distinction we can make is whether the dynamics is stochastic (e.g. Langevin) 

or deterministic (the Gaussian thermostat (|1.13p without energy-conserving noise). 

The only hard case is the stationary and deterministic one, the Gallavotti-Cohen (jGallavotti et al. 1995^ 

theorem. It is not only technically more subtle, but it relics on a real physical condition 

that the system has to meet, as wc shall sec. 

6.1.1 Transient, with or without bath 

Equation (|2.24p can be rewritten, for all fi (for the moment an arbitrary number): 



G(m) = G(l - m) 



(6.8) 




(6.9) 



Using the expression (|4.24p for the large deviation function, starting from the Gibbs- 
Boltzmann distribution \GB), we compute: 



= (-|e-^«e-*K-^H%/3«|Gi?) 

= (-|ne"*K'+^''-p]n-i|GB) 

= (-|e-*K-^f-p] \GB) = e-*G(i-^) (6.10) 



i.e. (j6.8p . Note that wc did not have to use any assumptions cither on times or on the 
dynamics. 
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6.1.2 Stationary with bath 

To compute the large-deviation function for long times, we proceed as in the previous 
section, and introduce eigenvectors and eigenvalues as in (|4.25p : 



Hk - -v 



1^4') ^ Kl^i") ; ii'tl HK-^i.v (6.11) 



to get: 

= (-|e-*K-Tf-v]|zmf) = V(-|V;f)(V'f Kmi) e-*^''^'') (6.12) 



e 



If the spectrum has a gap, the eigenvalue with the lowest real part dominates, and we 
have that G{^) = Ami„(/i) as t — !■ oo. 

Now, because \Hk — Tf-v] and \Hk — ''^f-v]^ are related by a similarity trans- 
formation (|6.9p . their spectra are the same. Hence, we have that, to the extent that to 
leading order in the time G(/i) depends only on eigenvalues and not on eigenvectors: 

G{^l) = G{l-^l) (6.13) 

In this case, ()6.13p is valid only at large times. Where can this fail? The problem in the 
large-time evaluation of (|6.12p arises if {tj:^ {\min)\init) ~ 0. At zero noise intensity 
this may well happen, because eigenvectors may in that case be completely localised. 

6.1.3 Gallavotti-Cohen theorem 

We shall not derive here the Gallavotti-Cohcn theorem, but just say a few words. 
The Gallavotti Cohen Theorem is the stationary fluctuation theorem for a system 
in contact with a deterministic Gaussian thermostat, like the one we introduced in 
(|1.13|) but with exactly zero noise. It turns out that unlike the stochastic and the 
transient case, which are essentially always valid, the Gallavotti-Cohen result breaks 
down in some systems, and in most cases when the forcing is very strong. This is not 
a defect of the proof, but a reflection of a physical fact: the Fluctuation Relation in 
the deterministic case holds only if the system has certain 'ergodic ' properties. 

To have a perspective on this, one can consider approaching the deterministic case 
as a limit of the noisy thermostatted case (|1.13p . One can reproduce the derivation 
above for this case (jKurchan 2007p . and easily conclude that for every level of noise 
e the theorem is valid, in the limit of large time windows t 3> tmini^), for some 
time tmm(e) • The trouble comes from the fact that as e ^ it may happen that 
^miri(e) diverges: in other words, the time imm(e) needed for the correct sampling of 
fluctuations may become infinite in the deterministic limit. The result of Gallavotti 
and Cohen proves that this is not the case for a class of very chaotic systems. For this 
class their theorem implies that limg^o imm(e) < oo. 

The timescale imm(e) has a clear physical meaning, which we just hint at here. 
A driven, deterministic, system has an attractor on the energy surface (see sect 2.4). 
Winding back in time the dynamics defines a repellor which is stationary but unstable. 
In chaotic Hamiltonian (undriven) systems, attractor and rcpcUor are intertwined, 
they occupy the same region in phase-space. As the drive is turned on, the attractor 
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distribution focuses on a region the energy shell, as described in Sect. 12.41 So does the 
rcpellor, in a region that may be non-overlapping. 

In the presence of weak noise, even if attractor and repellor are in principle separate, 
the system may occasionally switch from attractor to repellor and back: this is exactly 
analogous to the 'coexistence' we found in lecture 5. The typical time to do this 
( (jKurchan 2007p . see also (jBonetto et al. 1997aP ) is precisely tmm(e) for an otherwise 
ergodic system. The condition for tminif) to remain finite as the noise goes to zero 
is then that attractor and repellor overlap sufficiently in phase-space that it needs 
no noise to jump from one to the other. This is an extra condition the system must 
satisfy in order that the Gallavotti-Cohen fluctuation theorem to holds. In a word, the 
applicability of the Gallavotti-Cohen fluctuation theorem is, for a chaotic deterministic 
system, a symptom that attractor and repellor have not divorced under the effect of 
forcing. 

6.2 Jarzynski's equality 

Jarzynski's equality is a remarkable generalisation of the second principle. Consider 
a system with an energy dependent upon a parameter (volume, magnetic field, ...) 
which we denote a. We start from a Gibbs-Boltzmann equilibrium corresponding to 
parameter ainiuai and then evolve while changing a{t) at arbitrary speed up to a time 
t with a final = o:{t). The equality is then: 

g-/3[F(aj.„„,)-i^("...t.o,)] = l^^-P work ) ^^^^^^^^^^ (6.14) 

The average is over trajectories starting from equilibrium at t = 0, but otherwise 
arbitrary. Note the surprising appearance of F [a final), o.n equilibrium quantity, despite 
the fact that the system is not in equilibrium at time t. We shall prove this result for 
a Langevin system with inertia. It is valid independently of the friction coefficient 7, 
indeed even if 7 = and the system is isolated. 

From (|6.14p we can go to the second principle, taking logarithms on both sides and 
using Jensen's inequality (e^) > e^^'* 

F{afinai) - F{a.initiai) < work (6.15) 
f The time-dependent energy is: 

7^„=7^(q,p,a) = ^^ + y(q,a) (6.16) 

i 

If the parameter a depends on time, it docs work: 

' force X velocity = energy change — work 

dt 



dt 



We assume we start from the equilibrium configuration \GB{ai)) corresponding 
to a given value ai. The total evolution over a time t can be written by breaking the 
time in short intervals as in Fig. 16.11 
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U{t) = e-'^tti^M „,e-^tH^2e-^*"-^i (6.17) 
Because {—\Ha = (— |, we have 

1 = (-|e-**«°M...e-**^°^e-**«°i|GB(ai)) - {GB ia,)\e-'*< e-''"«2 ...er'^^'^M \-) 
= ^(_|e-/3«=: e-*'*^°ie-*'*^°^...e-'*^°M e/^^-M |Gi3 (6.18) 

As in all these theorems, we wish to introduce time-reversal. We proceed as follows: 
we insert factors between every two exponentials 

(^e/3«=2e-'3'"°2) 

e-/3Hoi e~^*^°i T i e-^*^°2...e-^*^°A/ e^^^-M (6.19) 

and use in each factor (|2.9[) : 

e-'^^-e-^^^-e^^-- =ne"**«-n-i (6.20) 

Putting everything in (|6.18p . using that the — 1 and that e"'^*-^"^ \GB (aA/)) — 
\GB (om)) we get: 

1 ^ ^(„|e~*-*^»ie-^(^°i-^°2) e-**^=2e-/3(«»2-«=3)...e"**^=M|G'i3 («m)) 

(6.21) 

Because Ha, - Wa(r+i) fx Sa, and using, as usual, that e-^'^e**-^ - g5t(A+B)+o[(5(t))'] 
for small St: 

= ^(^le-**^.^!-/'^*" e-**^"2-/3^^"...e-^*^"M-/3^^^;^*"|Gi? (aA/)) (6.22) 
which, by simple comparison means that 

where the average is over trajectories starting from initial points chosen with the 
equilibrium distribution at (aA/) and ending anywhere where evolution and noise takes 
them. 

We finally get 

e-^(^^---^"""-) = {e-^-^'' '^)o.„..„,^ any (6.24) 

The average is over trajectories starting from equilibrium at ainiuai and ending any- 
where that the dynamics takes them. The interpretation of work — J dt ^ has 
generated controversy IjPeliti 2008P , it is the work done by the system on the external 
sources (e.g. pistons, etc) of the 'fields' a. Here again, subtly different equalities can 
be obtained depending on the precise expression used for work, see ( [Jarzynski 2007[ ). 
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Fig. 6.1 Breaking the time-interval. 

6.3 A paradox 

The Jarzynski equality has been criticised (jCrooks et al 200"7|) on the basis that it it is 
supposed to fail for a process of free expansion. This paradox was completely resolved 
by Crooks and Jarzynski. The resolution is in itself instructive, because it highlights 
the role of rare fluctuations. The argument goes as follows: if we open the tap of a 
bottle with gas, letting it freely expand into an empty room, there is no work done, 
and yet there is a change in the free energies before and after: Jarzynski's equality 
must be violated. 

Let us first distinguish two ways of making a free expansion: with a sliding wall 
(Fig. 16. 2[) or with a rapidly receding piston (Fig. 16. 3[) . In the case of the sliding wall 
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1 















Fig. 6.2 Sliding wall. Left and right: nonequilibrium and equilibrium initial conditions, 
we see immediately where is the catch: we arc supposed to start with an equilibrium 
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initial condition, but this requires that the right-hand compartment in Fig. l6.2l be also 
full. If such is the case, when we slide the wall open there is neither work done nor 
free-energy change. 

This seems like a cheat, because we could have a solid piston receding infinitely 
fast as in Fig l6.3l and in that case no need of an equilibrated right hand compartment. 
Following Crooks and Jarzynski, suppose then that the piston is pulled backwards a 




Fig. 6.3 Pulling back a piston fast. The process and its time-reversed. 

huge velocity v. U v '3> {kTY^^, it is highly unlikely that any gas particle will hit the 
piston as it recedes, so in almost any run of the experiment no work is done. There are, 
however, very rare realisations of the experiment in which an unusually energetic par- 
ticle catches up with the piston, bounces, and looses most of its velocity in the process. 
This can be best seen if one considers the time-reversed process (Fig 16. 3p in which a 
rapidly incoming piston hits a slow moving particle. It turns out (jCrooks et al 2007| 
that the rare realizations in which this strong 'particle cooling' happens suffice to 
account for the exponential of the free energy difference. 

This example beautifully illustrates the role of rare fluctuations in Jarzynski's 
equality, and the extent to which what dominates the equation can be an extremely 
atypical process that violates our intuition, which is designed to apply to probable 
events. 



6.4 Experimental work 

A serious account of both the Fluctuation theorem and Jarzynski's equality should 
discuss the, by now quite considerable, experimental work. We shall not do this here, 
but just refer the interested reader to the References (jRitort 2007^ . 

Let us make however a few remarks, from a theoretician's perspective. The fluc- 
tuation theorem is, as its name suggests, a theorem. As such it need not be tested 
experimentally. The question is, of course, to what extent a specific physical system 
satisfies the hypotheses. Clearly, no real system has a Gaussian thermostat. Even a 
Langevin thermostat is an idealisation - a question that has been brought up in the 
context of Jarzynski's equality, (see below). The usual way out is to say that the 
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thermostat is 'far away' and then its nature becomes irrelevant. This would normally 
sound extremely reasonable, but bear in mind that, as the previous section shows, 
intuition may be misleading when applied to large deviations. At any rate, if the na- 
ture of the thermostat is indeed not important, then we might as well suppose it is a 
stochastic one, so that the fluctuation theorem holds without any assumption related 
to crgodicity. 

Consider for example the Lyon experiment (jCiliberto et al 1998P where a liquid 
is enclosed between two horizontal plates, the lower at higher temperature than the 
upper one Td> T^. Heat is transmitted by (Rayleigh-Benard) convection. The fluctu- 
ation theorem establishes a relation between having a heat flux J7 in one sense and in 
the other, P{J)/P{-J) - e^^''"-^'')'^* (jEckmann et al^ . Surely, one argues, the top 
and bottom plates can be considered to be in contact with a good Langevinian bath, 
the nature of Rayleigh-Benard convection cannot depend on that. If this assumption 
is correct, then the validity of the fluctuation theorem is not in doubt, as no crgodicity 
properties are required of the system when there is a Langcvin bath. But in fact, the 
experiment does not find a fluctuation relation with the true top and bottom tem- 
peratures (and indeed, with such temperatures the violations of the second law would 
be unobservable) . Why is that?. It would seem - but this needs further elucidation 
- that what the experiment is in fact testing is a pre-asymptotic result, because the 
time-window is not long enough. One can of course ask if at this pre-asymptotic level 
there is for some other reason another fluctuation relation, with a higher effective 
temperature, but this requires a different theory (jBonetto et al. 1997ap . 

Let us now turn very briefly to the Jarzynski equality. In this case, experimental 
work (jRitort 2007P has been made not so much to test the equality, as in the case of 
the fluctuation theorem, but to use it to evaluate free-energy differences with fast, out 
of equilibrium, measurements. For example, two RNA strands are unzipped by pulling 
with optical tweezers, and the work that this costs is measured (jRitort 2006p . The 
experiment is repeated many times, and from the work distribution the equilibrium 
free-energy difference is measured. If we move too fast, it is the rarer and rarer runs 
that dominate, so the experiment has to be repeated many times. Here again, the 
example of the piston in the previous section is very illuminating: if the piston recedes 
very fast, only in very rare repetitions of the experiment will a fast particle catch up 
with it and be stopped: but this event dominates the average in Jarzynski's equality!. 

Let us say a final word concerning Jarzynski's equality in the presence of a (water) 
bath. It has been argued (jCohen et al 2004)) that the water can not be considered as a 
Langevinian - or in fact as any other equilibrium - bath, while the system is evolving 
fast. This is indeed so, but the problem is solved ( [Jarzynski 1997[ ) by 'moving the bath 
away', i.e. by including the surrounding water as part of the system. 
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